Dynamic Documents Indexing in Evolving Contexts

نویسندگان

  • Rolf Nossum
  • Vladimir Oleshchuk
چکیده

In this paper we consider similarity between documents in contexts defined by ontologies. Two documents have different similarity degree depending on the context in which they are considered. We consider a scenario where context evolves incrementally, necessitating continual revision of similarity measures between documents. We develop algorithms that revise similarity between documents concurrently with operations updating the ontology.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient semantic indexing via neural networks with dynamic supervised feedback

We describe a portable system for e cient semantic indexing of documents via neural networks with dynamic supervised feedback. We initially represent each document as a modified TF-IDF sparse vector and then apply a learned mapping to a compact embedding space. This mapping is produced by a shallow neural network which learns a latent representation for the textual graph linking words to nearby...

متن کامل

Exemplary documents: a foundation for information retrieval design

Documents are generally represented for retrieval by either extracting index terms from them or by creating and selecting from an external set of candidate terms. There are many procedures for doing this, but while work continues along these dimensions, there have been relatively few attempts to change this basic process. Of particular importance is the creation of indexing schemes for retrieva...

متن کامل

Efficient Dynamic Indexing and Retrieval of XML Documents using Three- Dimensional Quasi-BitCube

XML is a new standard for exchanging and representing data on the Internet. Techniques for indexing and retrieval of XML data is drawing increasing attention since they enable one to access certain parts of retrieved documents easily. However, they provide little or no support for adding new documents to an existing document collection, requiring instead that the entire collection be re-indexed...

متن کامل

WordSieve: A Method for Real-Time Context Extraction

In order to be useful, intelligent information retrieval agents must provide their users with context-relevant information. This paper presents WordSieve, an algorithm for automatically extracting information about the context in which documents are consulted during web browsing. Using information extracted from the stream of documents consulted by the user, WordSieve automatically builds conte...

متن کامل

Context based Web Indexing for Storage of Relevant Web Pages

A focused crawler downloads web pages that are relevant to a user specified topic. The downloaded documents are indexed with a view to optimize speed and performance in finding relevant documents for a search query at the search engine side. However, the information will be more relevant if the context of the topic is also made available to the retrieval system. This paper proposes a technique ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005